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Abstract 

This paper contributes to the area of inductive logic programming by presenting a new 
learning framework that allows the learning of weak constraints in Answer Set Program¬ 
ming (ASP). The framework, called Learning from Ordered Answer Sets, generalises our 
previous work on learning ASP programs without weak constraints, by considering a new 
notion of examples as ordered pairs of partial answer sets that exemplify which answer 
sets of a learned hypothesis (together with a given background knowledge) are preferred to 
others. In this new learning task inductive solutions are searched within a hypothesis space 
of normal rules, choice rules, and hard and weak constraints. We propose a new algorithm, 
ILASP2, which is sound and complete with respect to our new learning framework. We 
investigate its applicability to learning preferences in an interview scheduling problem and 
also demonstrate that when restricted to the task of learning ASP programs without weak 
constraints, ILASP2 can be much more efficient than our previously proposed system. 

KEYWORDS: Non-monotonic Inductive Logic Programming, Preference Learning, An¬ 
swer Set Programming 


1 Introduction 

Preference Learning has received much attention over the last decade from within 
the machine learning community. A popular approach to preference learning is 
learning to rank l(Fiirnkranz and Hiillermeier 20031 iGeisler et al. 200T]) , where the 
goal is to learn to rank any two objects given some examples of pairwise preferences 
(indicating that one object is preferred to another). Many of these approaches use 
traditional machine learning tools such as neural networks (jGeisler et al. 200lj) . 

On the other hand, the field of Inductive Logic Programming (ILP) ( |Muggleton 199l| 
has seen significant advances in recent years, not only with the development of sys¬ 
tems, such as QRay et al. 2004|lKimber et al. 2009~l|Corapi et al. 20i0||Muggleton et al. 2012[ 
Mug gleton and Lin 2013J, but also the proposals of new frameworks for learn¬ 
ing (jOtero 200 1: Saka maand Inoue 2009ULaw et al. 2014jl . In most approaches to 
ILP, a learning task consists of a background knowledge B and sets of positive 
and negative examples. The task is then to find a hypothesis that, together with 

* This research is partially funded by the 7th Framework EU-FET project 600792 “ALLOW 
Ensembles”, and the EPSRC project EP/K033522/1 “Privacy Dynamics”. 





















2 


M. Law, A. Russo, K. Broda 


B, covers all the positive examples but none of the negative examples. While 
in previous work ILP systems such as TILDE (iBlockeel and De Raedt 199811 and 
Aleph (ISrinivasan 2001[) have been applied to preference learning (IDastani et al. 200T1 
IHorvath 2012|) . this has addressed learning ratings, such as good , poor and bad , 
rather than rankings over the examples. Ratings are not expressive enough if we 
want to find an optimal solution as we may rate many objects as good when some are 
better than others. Answer Set Programming (ASP), on the other hand, allows the 
expression of preferences through weak constraints. In a usual application of ASP, 
one would write a logic program which has many answer sets, each corresponding 
to a solution of the problem. The program can also contain weak constraints (or 
optimisation statements) which impose an ordering on the answer sets. Modern 
ASP solvers, such as dingo (IGebser et al. 20TTTb can then find the optimal answer 
sets, which correspond to the optimal solutions of the problem. For instance, in a 
scheduling problem, we could define an ASP program, whose answer sets correspond 
to timetables, and weak constraints that represent preferences over these timetables 
(see (jBanbara et al. 20131) for an example application of ASP in timetabling). 

In this paper, we propose a new learning framework, Learning from Ordered 
Answer Sets ( ILPloas), that allows the learning of ASP programs with weak con¬ 
straints. This framework extends the notion of learning from answer sets proposed 
in (lLaw et al. 2014tl . where ASP programs without weak constraints were learned 
using only positive and negative examples of partial answer sets. In our new learning 
task ILPloas , additional examples are defined, as ordered pairs of partial answer 
sets, and the language bias captures a hypothesis space of ASP programs contain¬ 
ing normal rules, choice rules and hard and weak constraints. A new algorithm is 
presented and proved to be sound and complete with respect to ILP loas- 

To demonstrate the applicability of our framework, we consider, as a running 
example, an interview timetabling problem and the task of learning, as weak con¬ 
straints, academics’ preferences for scheduling undergraduate interviews. An aca¬ 
demic might be more comfortable interviewing for one course than another, might 
prefer not to have many interviews on the same day, or might hold both of these 
preferences but regard the former as more important. Given ordered pairs of partial 
timetables, our approach is able to learn these preferences as weak constraints. 

The paper is structured as follows. Our new learning framework, ILPloas is pre¬ 
sented in Section [3] It extends the notion of Learning from Answer Sets (lLaw et al. 2014}) 
to the new task of learning weak constraints. We discuss formal properties of the 
framework such as the complexity of deciding the existence of a solution. Our learn¬ 
ing algorithm ILASP2 is described in Section |4j together with experimental results 
based on a scheduling example (Section 0 . We also show that ILASP2 can have 
increased efficiency over our previous system when learning programs without weak 
constraints. Discussion on related and future work concludes the paper. 

2 Background 

In this section we introduce the concepts needed in the paper. Given any atoms 
h, hi ,..., h 0 , bi ,..., b n , ci,..., c m , h ■*— b\ ,..., b n , not c\ ,..., not c m is called a 
normal rule, with h as the head and b\ ,..., b n , not c\,, not c m (collectively) as 
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the body (“not” represents negation as failure); a rule <— bi,...,b n , not a,..., 
not c m is a hard constraint; a choice rule is a rule l{h \,..., h 0 }u <— b±,... ,b n , 
not ci,..., not c m (where l and u are integers) and its head is called an aggregate. 
A variable in a rule R is safe if it occurs in at least one positive literal in the body 
of R. A program P is assumed to be a finite set of normal rules, choice rules, and 
hard constraints. The Herbrand Base of P, denoted HBp, is the set of variable 
free (ground) atoms that can be formed from predicates and constants in P. The 
subsets of HBp are called the (Herbrand) interpretations of P. A ground aggregate 
l{hi ,..., h 0 }u is satisfied by an interpretation I iff l < \I (~l {hi,..., h 0 }\ < u. 

As we restrict our programs to sets of normal rules, (hard) constraints and choice 
rules, we can use the simplified definitions of the reduct for choice rules presented in 
(ILaw et al. 2015cl) . Given a program P and an Herbrand interpretation I C HBp , 
the reduct P 1 is constructed from the grounding of P in 4 steps: firstly, remove rules 
whose bodies contain the negation of an atom in I; secondly, remove all negative 
literals from the remaining rules; thirdly, replace the head of any hard constraint, 
or any choice rule whose head is not satisfied by I with _L (where _L ^ HBp); and 
finally, replace any remaining choice rule { hi ,..., h m } 4— bi,...,b n with the set of 
rules {hi <- h,... ,b n \ hi €. in {hi ,..., h m }}. Any I C HBp is an answer set of P 
if it is the minimal model of the reduct P 1 . Throughout the paper we denote the 
set of answer sets of a program P with AS(P). 

Unlike hard constraints in ASP, weak constraints do not affect what is, or is not, 
an answer set of a program P. Hence the above definitions also apply to programs 
with weak constraints. Weak constraints create an ordering over AS(P) specifying 
which answer sets are “better” than others. The set of optimal (best) answer sets of 
P is denoted as AS*(P). A weak constraint is of the form bi ,..., b n , not Ci,..., 
not c m • [w@Z, ti ,..., t 0 ] where bi ,..., b n , ci,..., c m are atoms, w and l are terms 
specifying the weight and the level , and t\,..., t 0 are terms. A weak constraint W 
is safe if every variable in W occurs in at least one positive literal in the body 
of W. At each priority level l, the aim is to discard any answer set which does 
not minimise the sum of the weights of the ground weak constraints (with level l) 
whose bodies are true. The higher levels are minimised first. Terms specify which 
ground weak constraints should be considered unique. For any program P and 
A G AS(P), weak(P , A) is the set of tuples (w, l,ti,..., t 0 ) for which there is some 
b n , not ci,..., not c m ■ [w@Z, ti,...,t 0 \ in the grounding of P such that 
A satisfies bi ,..., b n , not ci,...,not c m . 

We now give the semantics for weak constraints (iCalimeri et al. 2013) . For each 
level l, P l A = T,( w yt 1 } ...,t 0 )eweak(p,A) w - For A h A 2 e AS(P), A i dominates A 2 
(written Ai )~p A 2 ) iff 3/ such that P Ai < P A2 and \/m > l, P Ai = P A2 - An answer 
set AgAS(P) is optimal if it is not dominated by any A 2 gAS(P). 

Example 1 

Let P be the program consisting of slot(m, 1); slot(m, 2); slot(t, 1); slot(t,2); and 
0{assign(D, S)}1 <— slot(D, S), which assigns 0 to 4 slots in a schedule (slot(m, 1) 
represents slot 1 on Monday). Let Wi, W 2 and W 3 be the weak constraints 

assign(D, S). [l@l], assign(D, S). [l@l, D] and assign(D, S). [l@l, D, S] respec- 
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tively. Applying each weak constraint to P gives as its optimal answer set the 
one in which no slots are assigned. The remaining answer sets are ordered in the 
following way: W\ considers all schedules in which slots have been assigned to be 
equally optimal, as there is only one unique set of terms fi,..., t n which is the 
empty set; Wi minimises the number of days in which slots have been assigned, as 
there is one unique set of terms per day; and finally, W 3 minimises the number of 
assignments made, as each combination of day and slot has a unique set of terms. 

In an ILP task, the hypothesis space is often characterised by mode declara¬ 
tions ( [Muggleton et al. 2012[ ). A mode bias can be defined as a pair of sets of 
mode declarations (Mh,Mb), where Mh (resp. Mb) are the head (resp. body) dec¬ 
larations. Each mode declaration m £ Mh , or m £ Mb , is a literal whose abstracted 
arguments are either v or c. An atom a is compatible with a mode declaration 
m if replacing the instances of v in m by variables, and the instances of c by 
constants yields a. The search space is defined to be the set of rules of the form 
h<— bi ,..., b n , not ci,..., not c m where (i) h is empty, h is an atom compatible 
with some m £ Mh, or h is an aggregate l {hr ,..., hk}u such that 0 < l < u < k 
and Vz £ [1, k] hi is compatible with some m £ Mh; (ii) Vz £ [1, n], Vj £ [1, m] bi 
and Cj are each compatible with at least one mode declaration in Mb; and finally 
(iii) all variables in the rule are safe. We require the rules to be safe because ASP 
solvers such as dingo (IGebser et al. 20111 ) have this requirement. We denote the 
search space defined by a given mode bias (Mh, Mb) as SLAs(Mh , Mb). 

In (jLaw et al. 20141) . we presented a new learning task, Learning from Answer 
Sets ( ILPlas ) which used partial interpretations as examples. A partial interpreta¬ 
tion e is a pair (e mc , e exc ) of sets of ground atoms, called inclusions and exclusions. 
An answer set A is said to extend e if and only if (e mc C A) /\(e exc C\ A = 0). Given 
partial interpretations ei and ei, ei extends ei iff e^ nc C e[ nc and e| xc C e{ xc . 

Definition 1 

A Learning from Answer Sets task is a tuple T = (B , SLAs(Mh, Mb), E + ,E~) where 
B is the background knowledge, SLAs(Mh,Mb) is the search space defined by a 
bias (Mh,Mb), E + and E~ are sets of partial interpretations called the positive 
and negative examples. A hypothesis H is in ILPlas(T), the set of all inductive 
solutions of T , if and only if H C SLAs(Mh, Mb); Ve + £ E + 3 A £ AS(B U H) such 
that A extends e + ; and finally, Ve - GE~ $Ag AS(B U H) such that A extends e~. 

The task of ILP is usually to find optimal hypotheses, where optimality is of¬ 
ten defined by the number of literals in a hypothesis. In ILP las, aggregates are 
converted to disjunctions before the literals are counted, giving a higher “cost” for 
learning an aggregate; for example, 1 {p, q} 2 is converted to (p A not q) V (q A 
not p) V (p A q) giving a length of 6. ILPlas aims at learning ASP programs con¬ 
sisting of normal rules, choice rules and hard constraints. This paper extends this 
notion to a new learning task capable of learning weak constraints. 

3 Learning from Ordered Answer Sets 

To learn weak constraints we extend the notion of mode bias with two new sets of 
mode declarations: M 0 specifies what is allowed to appear in the body of a weak 
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constraint; whereas M w specifies what is allowed to appear as a weight. A positive 
integer l max is also given to indicate the number of levels that can occur in H. 

Definition 2 

A mode bias with ordering is a triple M = {Mh, Mb, M a , M w , lmax), where Mh and 
Mb are respectively head and body declarations, M a is a set of mode declarations 
for body literals in weak constraints, M w is a set of integers and l max is a positive 
integer. The search space Sm is the set of rules R that satisfy one of the conditions: 

• R G SLAs(M h , Mb). 

• R is a safe weak constraint&i,..., 6^, not &i+i,..., not bj . [w@Z, ti ,..., t n ] 

such that Vfce [1, j\ bk is compatible with M 0 ; t \,..., t n is the set of terms in 
bl,...,bj, W €: M w , l [0, lmax ) • 

Note that even if we were to extend the learning task in Definition |T] with this 
new notion of mode bias, such a task would never have as its optimal solution 
a hypothesis which contains a weak constraint. This is because a Learning from 
Answer Sets task has only examples of what is, or is not, an answer set. Any 
solution containing a weak constraint W will have the same answer sets without 
W, and would be more optimal. We now define the notion of ordering examples. 

Definition 3 

An ordering example is a tuple o = (e i, ef) where e\ and e 2 are partial interpreta¬ 
tions. An ASP program P bravely respects o iff 3Ai, A 2 &AS{P) such that A- t ex¬ 
tends ei, A 2 extends e 2 and A\ >-p A 2 . P cautiously respects o iff VAi, A 2 G AS(P) 
such that A\ extends e\ and A 2 extends e 2 , it is the case that A\ )~p A 2 . 

Example 2 

Consider the partial interpretations e\ = ({assign(m, 1), assign(m, 2)}, {assign(t, 1), 
assign(t, 2)}} and e 2 = ({assignfm, 1), assign(t, 1)}, 0). Let o = (ei, e 2 ) be an or¬ 
dering example and recall P and W \,..., W 3 from example [T) The only answer 
set of P that extends e\ is m\m 2 (where m\m 2 denotes {assign(m, 1), assign(m, 2)}), 
whereas the answer sets that extend e 2 are m\ti,m\m 2 ti,mim 2 tit 2 and m\t\t 2 . 
PC W\ does not bravely or cautiously respect o as it gives to all these answer sets 
the same optimality; P U W 2 both bravely and cautiously respects o, as each pair 
of answer sets extending the partial interpretations is ordered correctly (i.e. answer 
sets extending e\ have slots allocated in only one day whereas all the answer sets 
extending e 2 have slots assigned in two days). Finally, PUW 3 respects o bravely but 
not cautiously (the pair of answer sets m\m 2 and m\ <1 is such that m\m 2 )f-p m\ti). 

We can now define the notion of Learning from Ordered Answer Sets ( ILP^oas )■ 

Definition f 

A Learning from Ordered Answer Sets task is a tuple T = ( B , Sm, E + , E~ , O b , O c ) 
where B is an ASP program, called the background knowledge, Sm is the search 
space defined by a mode bias with ordering M, E + and E~ are sets of partial 
interpretations called, respectively, positive and negative examples, and O b and 
O c are sets of ordering examples over E + called brave and cautious orderings. A 
hypothesis H C Sm is in ILPloas(T), the inductive solutions of T, if and only if: 
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1. Let Mh and Mb be as in M and H' be the subset of H with no weak con¬ 
straints. H' € ILP las {(B, S LA s(M h , M b ), E + , E~)) 

2. Vo e O b B U H bravely respects o 

3. Vo G O c B U H cautiously respects o 

The notion of an optimal inductive solution of an ILPloas task is the same as 
in ILP L aS: with each weak constraint W counted as the number of literals in the 
body of W. Note that the orderings are only over E + rather than orderings over 
any arbitrary partial interpretations. We chose to make this restriction as we could 
not see a reason why a hypothesis would need to respect orderings which are not 
extended by any pair of answer sets oi B Li H. Note also that in the case where O b 
and O c are both empty, the task reduces to an ILPlas task. 

Example 3 

Consider the ILPloas task T with the following background knowledge: 
slot(m, 1..3). slot(t, 1..3).slot(w,1.. 3). 
neq(l, 2) ,neq(l, 3) .neq(2, 1) .neq(2, 3) .neq(3, 1) .neq(3, 2). 
neq(m, t). neq(m, w) .neq(t, m) .neq(t, w) .neq(w, m) .neq(w, t). 
type(m, 1, cl) .type(m, 2, c2) .type(m, 3, c2) .type(t, 1, c2). 
type(t, 2, c2) .type(t, 3, c2) .type(w, 1, c2) .type(w, 2, cl) .type(w, 3, c2). 
0{assign(X, Y)}1:-slot(X, Y). 

Using the notation from example [2j let T have the positive examples e\ = 
($,m 2 m 3 t 3 t 3 w\w 2 ), e 2 = (mira 2 ,0), e 3 = (0, mit 2 wiw 2 ), e 4 = (ht 2 t 3 ,®), e 5 = 
(m 2 m 3 tit 2 t 3 wi w 3l (/)); e 3 = ( m\W\w 3 , m 2 m 3 t\t 2 t 3 w 2 )] two cautious orderings: (ei, e 2 ) 
and (e 3 ,e 4 ); and one brave ordering: (e 3 ,e 3 ). Consider Sm to be defined by the 
mode declarations: Mh = Mb = 0; M a = {assignfv, v), neq(v, v), type(v, c)}; M w = 
{ — 1,1}; and l max = 2. Note that as each positive example is already covered by 
the background knowledge and there are no negative examples, it remains to find 
a set of weak constraints which meet conditions 2 and 3 of definition |4] One in¬ 
ductive solution H of T is {:~ assign(D, SI), assign(D, S2),neq(Sl, S2) . [l@l, D, SI, S2] 
:~ assign(D, S),type(D, S, cl). [l@2, D, S]}; this respects the first cautious ordering ex¬ 
ample because any timetable extending e\ has at most one cl course whereas e 2 has 
at least one, so e± is better or equal to e 2 on the highest priority weak constraint; 
even if they are equal, a timetable extending e\ has at most one assignment per day 
and is, therefore, always better on the lower priority weak constraint. H also re¬ 
spects the other cautious ordering and the timetables m 2 m 3 t\t 2 t 3 wiw 3 and m\W\w 3 
correspond to answer sets which demostrate that the brave ordering is respected. 

In fact, there is no shorter hypothesis which meets conditions 1 to 3 and so H 
is an optimal inductive solution; moreover, the other optimal solutions are equiv¬ 
alent hypotheses such as: {:~ assign(D, Sl),assign(D, S2), neq(Sl, S2). [l@l, D, SI, S2]; 
:~ assign(D, S),not type(D, S, c2). [l@2, D, S]}. These hypotheses represent the prefer¬ 
ences described in the introduction. They express that the highest priority is to 
minimise the interviews for cl, and then to minimise the slots in any one day. 

We now discuss some of the formal properties of ILPloas- All learning tasks in 
the rest of this section are assumed to be propositional (B and Sm are both ground). 
The proofs for Theorems [T| to [3] can be found in (ILaw et al. 2015bl) . Theorems [T) 
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and [ 2 ] state sufficient and necessary conditions for there to exist solutions for an 
ILPloas task with an unrestricted search space (hypotheses can be any set of 
normal rules, choice rules and hard and weak constraints). 

Theorem 1 

Let T be the ILPloas task (B, E + , E ~, O b , O c ). The following conditions (in 
conjunction) are sufficient for there to exist solutions of T: (i) Ve G E + , there is 
at least one model of B which extends e; (ii) Vei G E + , $e 2 G ( E + U E~) such 
that e\ extends ei\ (hi) there is no cyclic chain of ordering examples (in O b U O c ) 
(ei, e 2 ), (e 2 , 63 ),..., (e„_i, e„), (e n , ef). 


Theorem 2 

Let T be the ILPloas task (B, E + , E~ , O b , O c ). The following conditions are 
necessary for there to exist solutions of T: (i) Ve € E + , there is at least one model 
of B which extends e; (ii) Vei G E + , $e 2 € E~ such that e\ extends e 2 ; (iii) there 
is no cyclic chain of cautious orderings, (ei, e 2 ), (e 2 , 63 ),..., (e„_i, e n ), (e n , e\). 

Note that if we consider the usual setting where hypotheses come from a search 
space, the conditions in theorem[ 2 ]are still necessary, but the conditions in theorem|T] 
are no longer sufficient as, even if the conditions hold, the search space may be too 
restrictive. Theorem [3] states the complexity of deciding the existence of solutions 
for both ILPlas and ILPloas tasks. The interesting property here is that deciding 
the existence of solutions for ILPloas is in the same complexity class as ILPlas- 

Theorem 3 

Let T be any ILPlas or ILPloas task. Deciding whether T has at least one 
inductive solution is NP np - complete. 

4 Algorithm 

We now describe our new algorithm, ILASP2 , capable of computing inductive so¬ 
lutions of any ILPloas task, and present its soundness and completeness results 
with respect to the notion of Learning from Ordered Answer Sets task given in 
Definition |4l We omit the proofs of the theorems in this paper, but they can be 
found in full in (ILaw et al. 2015bl) . For details of how to download and use our 
prototype implementation of the ILASP2 algorithm, see (ILaw et al. 2015al) . 

ILASP2 extends the concepts of positive and violating hypotheses, first intro¬ 
duced in our previous algorithm ILASP (ILaw et al. 20141) . to cater for the new 
notion of ordering examples. A hypothesis is said to be positive if it covers all 
positive examples and bravely respects all the brave ordering examples. A positive 
hypothesis is defined to be violating if it covers at least one negative example or 
if it does not respect at least one of the cautious ordering examples. These two 
notions are formalised by Definitions [5] and [G] 

Definition 5 

Let T = (B , Sm, E + , E~ , O b , O c ) be an ILPloas task. Any H C Sm is a positive 
hypothesis iff Ve G E + 3A G AS(B U H ) such that A extends e, and Vo G O b 
HUB bravely respects o. The set of positive hypotheses of T is denoted V(T). 
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Definition 6 

A positive hypothesis H is a violating hypothesis of T = (B, Sm, E + , E ~, 0 b , 0 C }, 
written H £V{T), iff at least one of the following cases is true: 

• 3e~ £E~ and 3A£ AS(B U H) such that A extends e~. In this case we call 
A a violating interpretation of T and write ( H , A) £ VT(T). 

• 3Ai, A 2 £AS(B\JH) and 3(ei, e 2 ) £ O c such that A\ extends e\, A 2 extends 
e 2 and A\ A 2 with respect to B U H. In this case, we call (Ai,A 2 ) a 
violating pair of T and write ( H , {A\, A 2 )) £ VV{T). 

Example 4 

Consider an ILPloas task with B equal to P in Example [T| but with one additonal 
fact busy(l, 1 ); positive examples e+ = ({assign(t, 1 ), assign(t, 2 )}, {assign(m, 2 )}) 
and e 2 = ({assign(m, 2 ), assign(t, 1 )}, 0); one negative example efi = ({assign(m, 1 )}, 
0 ); and one cautious ordering (e^ , e£). Sm is unrestricted (hypotheses can be con¬ 
structed from any predicate that appears in B and E). Three example hypotheses 
are given below. Note that when we describe answer sets we omit the facts in B. 

Hi = %£V(T) as e + and e% are covered and O b is empty; however, Hi £ V(T) 
for two reasons: firstly it has a violating interpretation ({assign(m, 1)}); secondly it 
has a violating pair (({assign(t, 1), assign(t, 2)}, {assign(m, 2), assign(t, 1)})). 

H 2 = {-<— busy(D, S), assign(D, S)} £V{T). H 2 has no violating interpretations, but 
it has a violating pair (({assign(t, 1), assign(t, 2)}, {assign(m, 2), assign(t, 1)})). 

H 3 = H 2 U{:~ assign(D, S).[1@1,D]} £V(T). H 3 £ V(T), as it lias no violating 
interpretations and its weak constraints minimise the days assigned (so it cautiously 
respects the ordering example). It is, therefore, an inductive solution of the task. 

One approach to computing the inductive solutions of an ILPloas task would be 
to extend the original ILASP method with our new notions of positive and violating 
hypotheses. Given a positive integer n , ILASP worked by constructing an ASP rneta 
representation of an ILPlas task T , called the task program 7 1 ™ et0 , whose answer 
sets could be mapped back to the positive hypotheses of T of length n. T^ eta could 
then be augmented with an extra constraint so that its answer sets corresponded 
exactly to the violating hypotheses of length n. ILASP first computed the violating 
hypotheses of length n , and then converted each of these to a constraint at the meta¬ 
level (ruling out that hypothesis). When T^ eta was then augmented with these new 
constraints, its answer sets corresponded exactly to the positive hypotheses which 
were not computed the first time - the inductive solutions of length n. 

The problem is that, in general, there can be many violating hypotheses which 
are shorter than the first inductive hypothesis and ILASP will compute all of them 
and add them into the task program as individual constraints. This scalability issue 
would be worsened if we were considering adding weak constraints to the search 
space. To overcome this, ILASP 2 adopts a different strategy: it eliminates classes 
of hypothesis, i.e. hypotheses that are violating for the same “ reason ”, namely they 
give rise to a particular violating interpretation or a particular violating pair of 
interpretations. The idea underlying the ILASP2 algorithm is to make use of two 
sets VI and VP which accumulate, respectively, violating interpretations and vi¬ 
olating pairs of interpretations that are constructed during the search. We start 
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initially with two empty sets VI and VP and continually compute the set of opti¬ 
mal remaining hypotheses which do not violate any of the reasons in VI or VP. If 
a computed hypothesis gives rise to a new violating interpretation then this inter¬ 
pretation is added to VI, if it gives rise to a new violating pair of interpretations 
then this pair is added to VP. If no optimal remaining hypotheses are violating, 
then these hypotheses are the optimal inductive solutions of the task. 

Definition 7 

Let T be an ILPloas task, VI and VP (resp.) be sets of violating interpretations 
and pairs of interpretations, and B be the background knowledge. Any H £ V{T) 
is a remaining hypothesis of T with respect to VI U VP iff VI H AS{B U H) = 0 
and V(/i, I 2 ) £ VP if I\,I 2 £ AS{B U H) then I\ Pb\jh h- A remaining hypothesis 
H is a remaining violating hypothesis iff 31? such that (H, R) £ VI( T) U W( T). 

We use an ASP meta-level representation to solve our search for remaining hypothe¬ 
ses. As we rule out classes of hypothesis at the same time (rather than using one 
constraint per violating hypothesis), our meta-level representation is slightly more 
complex than that used in the original ILASP. Due to this complexity, we define 
this representation in the online appendix and give here the underlying intuition. 

The intuition of our meta encoding is that for a given task T, we construct an ASP 
program T meta whose answer sets can be mapped back to the positive hypotheses 
of T. Given an answer set A of T meta we write (A) to denote the hypothesis 
represented by A. Each positive hypothesis may be represented by many answer sets 
of T meta but if this hypothesis gives rise to a violating interpretation, then at least 
one of these answer sets will contain a special atom vji. If the hypothesis gives rise 
to a violating pair of interpretations then at least one of the answer sets of T meta 
representing the hypothesis will contain a special atom vjp(t\,t 2 ), where (t\, t 2 ) is 
a pair of identifiers corresponding to the cautious ordering example which is being 
violated. There is only one priority level in T meta and the optimality of its answer 
sets is 2 * \H\ + 1 if the answer set does not contain the atom violating (violating 
is defined to be true if and only if vJ or at least one v.p{t\, t 2 ) is true) and 2 * \H\ 
if it does. This means that for any hypothesis H, the answer sets corresponding to 
H that do contain violating are preferred to those which do not. 

We can use T meta to find optimal positive hypotheses of T. If these positive solu¬ 
tions are violating, then the optimal answer sets will contain violating. We can then 
rule these hypotheses out. We can extract violating interpretations and violating 
pairs of interpretations from answer sets of T meta , using the functions M.~l and 
respectively. Violating interpretations and violating pairs of interpretations 
are both called violating reasons. For any set of violating reasons VR = VIU VP, we 
then have a second meta encoding VR me t a {T ) which, when added to T me t a , rules 
out any hypotheses which are violating for a reason already in VR. This means that 
the answer sets of T meta U VR me ta(T) will represent the set of remaining hypotheses 
of T with respect to VR. These properties are guaranteed by Theorem |U 

Theorem 4 

Given an ILPloas task and a set of violating reasons VR, let AS be the set of 
optimal answer sets of T meta U VR me t a {T). 
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• If 3A £ AS such that violating G A then the set of optimal remaining violating 
hypotheses VH is non empty and is equal to the set {M.J iyp (A) \ A G A5}. 

• If no A G .45 contains violating, then the set of optimal remaining hypotheses 
(none of which is violating) is equal to the set {Ai^y p (A) | A G .45}. 


Algorithm 1 ILASP2 
procedure ILASP2(T) 

VH = 0 

Solution — Solve(T met a U U5 me £ a (T)) 

while solution ^ nil && solution. optimality%2 == 0 do 
A = solution. answer set 

if V-i G A then 

VR+= MZt{A) 

else if 3fi, such that V-p(t\, £ 2 ) G A then 
VR+= M~p(A) 

end if 

Solution — Solve(T meta U VRmetai.'L')') 

end while 

return {M^ p (A) \ AeAS*(T meta U VR meta {T))} 

end procedure 

Algorithm [I] is the pseudo code of our algorithm ILASP2. It makes use of our meta 
encodings T meta and VR me ta{T). For any program P, solve(P) is a function which, 
in the case that P is satisfiable, returns a pair consisting of an optimal answer 
set together with its optimality (as there is only one priority level in our meta 
encoding this is treated as an integer); if P is unsatisfiable then solve(P) returns 
nil. While there are optimal remaining violating hypotheses, ILASP2 finds them 
and records the appropriate violating reasons. When there are no optimal remaining 
hypotheses which are violating then either the meta program will be unsatisfiable 
or the optimality of the optimal answer sets will be odd (as the optimality of 
any A G AS(T meta U VR meta (T)) is 2 * \M^y p (A)\ if A contains violating and 
2 = 1 = \M.~^y p {A)\ + 1 if not), and so ILASP2 stops and returns the set of optimal 
remaining hypotheses. Theorem [5] shows that ILASP2 is sound and complete with 
respect to the optimal inductive solutions of an ILPloas task. This result relies on 
the termination of ILASP2(T), which is guaranteed if B U Sm grounds finitely. 

Theorem 5 

Let T be an ILPloas task. If ILASP2(T) terminates, then ILASP2(T) returns 
the set of optimal inductive solutions of ILPloas(T)- 

5 Experiments 

Although there are benchmarks for ASP solvers (IDenecker et al. 2009 j) . there are 
no benchmarks for learning ASP programs. In (ILaw et al. 20141) we discussed the 
example of learning an ASP program with no weak constraints, representing the 
rules of sudoku. Using the examples from the paper and a small search space with 
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only 283 rules, the original ILASP algorithm takes 486.2s to solve the task. This 
is due to the scalability issues discussed in section [4] as there are 332437 violating 
hypotheses found before the first inductive solution. For the same task with ILASP2, 
there are only 9 violating reasons found before the first inductive solution, meaning 
that ILASP2 takes only 0.69s to solve the task. 

As this is the first work on learning weak constraints, there are no existing bench¬ 
marks suitable for testing our approach of learning from ordered answer sets. We 
have, therefore, further investigated the interview scheduling example discussed 
throughout the paper. Our experiments, in particular, test whether ILPloas can 
successfully learn weak constraints from examples of brave and cautious orderings. 
For the purpose of presentation, we assume our hypothesis space, Sm , to be defined 
by the mode declarations: Mh = Mb =0; M a = {assign(v, v), neq(v, v), type(v, c)}; 
M w = { — 1,1}; and finally, l max = 2. We place several restrictions on the search 
space in order to remove equivalent rules. The size of Sm is 184 (our hypotheses 
can be any subset of these 184 rules, so even considering only hypotheses with up 
to 3 rules this gives over a million different hypotheses). The learning task uses 
background knowledge B from Example [3] As Sm only contains weak constraints, 
for any H C Sm, AS(B U H) = AS(B). The learning tasks described in these 
experiments therefore correspond to learning to rank the answer sets of B. 


Hypothesis predictive accuracy 



Number of examples 


Hypothesis predictive accuracy 



Fullness of examples (%) 


(a) 


(b) 


Fig. 1: Accuracy with varying (a) numbers of examples; (b) fullness of examples 


For each experiment we randomly selected 100 hypotheses, each with between 
1 to 3 weak constraints from Sm, omitting hypotheses that ranked all answer sets 
equally. The only atoms that vary in B are the assign’s. As there are 9 different 
slots, there are 2 9 answer sets of B (and many more partial interpretations which are 
extended by these answer sets). We say an example partial interpretation is full if it 
specifies the truth value of all 9 assign atoms, otherwise we describe the fullness as 
the percentage of the 9 atoms which are specified. In both experiments (for each of 
the 100 target hypotheses Ht ), we generated ordered pairs of partial interpretations 
o = (ei, e-i) such that o was bravely respected. If o was also cautiously respected, 
then it was given as a cautious example (otherwise it was used as a brave example). 
In our first experiment we investigated the effect of varying the number of examples, 
and in the second we investigated the effects of varying the fullness of the examples. 
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Fig. 2: Average running time of ILASP2 with varying numbers of examples 


In both experiments, we tested our approach 20 times for each target hypothesis 
Ht- Each time, we used ILASP 2 to learn a hypothesis Hi which covered all exam¬ 
ples. We then calculated the accuracy of Hl in predicting the pairwise ordering of 
answer sets in B (for each pair of answer sets A 1; A 2 € AS(B) we tested whether 
Ht and Hi agreed on the preference between them). 

In our first experiment we investigated the effect of varying the number of exam¬ 
ples from 0 to 20. The examples were of random fullness, each with between 5 to 
9 assign atoms specified. Figure [1] a) shows the average predictive accuracy. Each 
point on the graph corresponds to 2000 learning tasks (100 target hypotheses with 
20 different sets of examples). The error bars on the graph show the standard error. 
The results show that our method achieves 90% accuracy for this experiment with 
around 10 or more random examples. 

For our second experiment we again tested our approach on 100 randomly gener¬ 
ated hypotheses with 20 different sets of randomly generated examples. This time, 
however, we have kept the number of examples fixed at 5, 10 and 20 and varied the 
fullness of the examples. Results are shown in Figure [IJb). The graph shows that 
examples are only useful if they are more than 50% full. One interesting point to 
note is that the peak performance is with examples of around 90% fullness. This 
is because cautious ordering examples are actually more useful if they are less full 
(as there are more pairs which extend them); however, orderings are less likely to 
be cautiously respected when they are less full. 

In our final experiment, we investigated the scalability of ILASP2 by increasing 
both the number of days in our timetable and the number of examples. Figure [2] 
shows the average running time for ILASP2 with 3, 4 and 5 day timetables (each 
with 3 slots) with up to 120 ordering examples. The learning tasks are targeted at 
learning the hypothesis from Example[3l We randomly generated ordering examples, 
as in the previous experiments with the slight difference that the fullness of the 
examples was unrestricted. As the hypothesis in these experiments does not use 
negative weights in either of the weak constraints, we also tested the average running 
time with a search space containing only positive weights. This means that Sm 
contained 92 weak constraints rather than the original 184. These experiments 
show that the time taken to solve an ILASP2 task is dependent not only on the 
number of examples, but also on the size of the domain and the size of Sm- 
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6 Related Work 

In (lLaw et al. 2014[) we showed that any of the learning tasks in 
|Ray 2009] ISakama and Inoue 20091 IQtero 200111 could be expressed by ILPlas 
and computed by ILASP. As any ILPlas task can be (trivially) mapped into 
an ILPloas (he. O b = 0 and O c = 0), ILPloas inherits this property. None of 
the previous learning tasks (including ILPlas ), however, can construct examples 
which incentivise the learning of a hypothesis containing a weak constraint. This is 
because they can only give examples of what should (or shouldn’t) be an answer 
set of I? UR. In addition, ILPloas inherits the capability of ILPlas of supporting 
predicate invention, allowing new concepts to be invented whilst learning. 

The ILASP2 algorithm is an extension of the original ILASP algorithm in (|Law et al. 20141) 
It extends the concepts of positive and violating hypothesis to cover learning weak 
constraints (which was not possible in ILASP). For the simpler ILPlas tasks, 

ILASP2 is more efficient than ILASP. As discussed in section U the original ILASP 
algorithm has some scalability issues when there is a large number of violating hy¬ 
potheses. We have shown in section [5] that by eliminating violating reasons rather 
than single violating hypotheses, ILASP2 can be much more efficient. 

Also related to our work are existing approaches for learning to rank. These use 
non logic-based machine learning techniques (e.g. neural networks (IGeisler et al. 20(711) 1 
Our approach shares the same advantages as any ILP approach versus a non logic- 
based machine learning technique: learned hypotheses are structured, human read¬ 
able and can express relational concepts such as minimising the instances of par¬ 
ticular combinations of predicates. Existing background knowledge can be taken 
into account to capture predefining concepts and the search can be steered to¬ 
wards particular types of hypotheses using a language bias. Furthermore, ILASP2 
is also capable of learning preferences with weights and priorities, meaning that 
more structured and complex preferences can be learned. 

An example of the use of an ILP system for learning constraints has been recently 
presented in (lLallouet et al. 20101) where timetabling constraints are learned from 
positive and negative examples. In this case the learned rules are hard constraints 
(e.g., enforcing that a teacher is not in two places at once). Examples of this kind 
are already computable by ILPlas , and so are also computable by ILPloas- 

7 Conclusion and Future Work 

We have presented a new framework for ILP, Learning from Ordered Answer Sets, 
which extends previous ILP systems in that it is able to learn weak constraints and 
can be used to perform preference learning. The framework can represent partial 
examples under a brave and a cautious semantics. We have also put forward a 
new algorithm, ILASP2, that can solve any ILPloas task for optimal solutions. 

This algorithm extends previous work for solving the simpler task ILPlas and 
resolves some of the scalability issues associated with the previous algorithm. Some 
scalability issues remain, especially when there is a particularly large hypothesis 
space and future work will focus on overcoming these. Current work also addresses 
extending the ILASP algorithm to support noisy examples. 


(Corapi et al. 2012 
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Appendix A The ILASP2 Meta Encoding 

We present here the ILASP2 meta encoding which is omitted from the main paper. 
We first summarise some notation used in the encoding. 

We will write body + (R) and body~(R ) to refer to the positive and negative (re¬ 
spectively) literals in the body of a rule R. Given a program P , weak(P) denotes 
the weak constraints in P and nori-weak(P) denotes the set of rules in P which are 
not weak constraints. 

Definition 8 

For any ASP program P, predicate name pred and term term we will write 
reify(P,pred, term) to mean the program constructed by replacing every atom 
a E P by pred(a, term). We will use the same notation for sets of literals/partial in¬ 
terpretations, so for a set S: reify(S , pred, term) = {pred(atom, term) : atom £ S}. 

Definition 9 

For any ASP program P and any atom a, append {P, a) is the program constructed 
by appending a to every rule in P. 

Definition 10 

Given any term t and any positive example e, cover(e, t) is the program: 

cov(t): -in_as(e) nc , t),. .., in_as(e xnc , t), not in_as(ei XC , t),.. . , not in_as(e“ c , t). 
:-not cov(t). 

The previous three definitions can be used in combination to test whether a 
program has an answer sets which extend given partial interpretations. 

Example 5 

Consider the program P = < ^ ^ > and the partial interpretations I\ = 

l q:-not p. J 

(M,0) and h = <0, {p})- 

The program Q = append (reify (P, in .as, X), as(X)) U {as(asl), as(as2)} U 
cover(I\, asl) U cover(l 2 , as 2) has the grounding: 

in_as(p, asl) not in_as(q, asl), as(asl). 

in_as(q, asl) not in_as(p, asl), as(asl). 

in_as(p, as2) :- not in_as(q, as2), as(as2). 

in_as(q, as2) :- not in_as(p, as2), as(as2). 

as(asl). 
as(as2) . 

cov(asl) in_as(p, asl). 

:- not cov(asl). 

cov(as2) :- not in_as(p, as2). 
not cov(as2). 

Without the two constraints, Q would have 4 answer sets (the combinations of 
asl and as2 corresponding to the two answer sets of P). With the two constraints, 
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the answer set represented by asl must extend I\, and the answer set represented 
by as2 must extend I 2 . Hence, there is only one answer set of Q, {as(asl), as(as2), 
in_as(p,asl), in_as(q, as2), cov(asl), cov(as2)}. 

Note that the answer sets of P which extend Ii and I 2 ({p} and {q}) can be 
extracted from the in-as atoms in this answer set of Q. If there were multiple 
answer sets of P extending one or more of the partial interpretations, then there 
would be multiple answer sets, representing all possible combinations such that the 
constraints are met. 

Definition 11 

Let p\ and P 2 be distinct predicate names and 1 be a term. Given R , a weak 
constraint bi,..., b m , not ci, ...,not ci. [wt@lev, ti,..., t n ], meta wea k(R , Pi,P 2 , t) is 
the rule: 

w(wt, lev, args(ti, . . . , t n ), t) : -p 2 (X), pi(bi, t), .. . ,pi(b m , t),not pi(ci, t), . . . , not pi(ci, t) . 
For a set of weak constraints W, meta wea k( W, pi, P 2 , t) is the set 
{meta wea k(R,pi,P 2 ,t) \ R G W}. 


Example 6 

Consider the program P containing the two weak constraints: 

p(V) . [102, V] 
q(V). [201, V] 

meta wea k{P, in-as, as, X) is the program: 

w(l, 2, args(V), X) as(X), in_as(p(V), X). 

w(2, 1, argsCV), X) as(X), in_as(q(V), X). 

Note that for any program P, if we reify an interpretation I = {a!,..., a n } as 
{in_as(ai, id),..., in_as(a n , id)} (the set reify(l, in .as , id)) then the atoms 
w(wt, l, args(ti, ..., t m ), id) in the (unique) answer set of 
meta wea k(weak(P), in-as, as, X) U reify (I, in-as, id) U {as(id)} correspond exactly 
to the elements (wt, l, t\,... t m ) of weak(P, I). 

For example, consider the interpretation I = {p(l),p(2), g(l)}. The unique an¬ 
swer set of meta wea k{weak(P), in-as, as, X)U reify(I, in-as, *d)U{as (id)} is { as(id), 
in_as(p(l), id), in_as(p(2), id), in_as(q(l), id), w(l, 2, args(l), id), w(l, 2, args(2), id), 
w(2,1, args(l), id)}. In this case, weak{P, I) = {(1, 2,1), (1, 2, 2), (2,1,1)}. 

Now that we have defined the predicate w to represent weak(P, A) for each answer 
set A, we can use some additional rules to determine, given two interpretations, 
whether one dominates another. 

Definition 12 

Given any two terms tl and t2, dominates(tl, t2) is the program: 

! dom_lv(tl, t2, L): -lv(L), #sum{w(W, L, A, tl) = W, w(W, L, A, t2) = -W} <0. 
non_dom_lv(tl,t2,L):-lv(L),#sum{w(W,L,A,t2) = W, w(W, L, A, tl) = -W} < 0. 
non_bef (tl, t2, L): -lv(L), lv(L2), L < L2, non_dom_lv(tl, t2, L2). 
dom(tl, t2):-dom_lv(tl, t2, L), not non_bef (tl, t2, L). 
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The intuition is that dom(idl, id2) (where idl and id2 represent two answer sets 
Ai and A 2 ) should be true if and only if A\ dominates A^. This is dependent on 
the atoms dom_lv(idl, id2,1), which for each level l, should be true if and only if 
P A < P l A2 ; non_dom_lv(idl, id2,1), which for each level l, should be true if and only 
if P l A2 < P l A ; and finally, non_bef (idl, id2,1), which for each level l, should be true 
if and only if there is an I 2 > l such that P l J 2 < P A . 

Example 7 

Consider again the program P and interpretation I from example [G] Consider also 
an additional interpretation P = {p(l), p( 2), p( 3)}. weak(P , P) = {(1, 2,1), (1, 2, 2), 
(1,2,3)}. 

The unique answer set of meta wea k(weak(P ), in-as, as, X ) U reify (I, in-as, id 1) U 
reify (P, in-as, id2) U {as(idl), as(id2), lv(l), lv(2)} U dominates (idl, id2) contains 
dom_lv(idl, id2,2), because I dominates I' at level 2 (i.e. Pj < Pf,)', contains 
non_dom_lv(idl, id2,1), because V dominates I at level 1; does not contain any 
non-bef atoms, because the only level at which P dominates I is 1, which is not 
evaluated “before” any other level (it is the lowest level in the program); and finally, 
does contain dom(idl, id2) because / dominates /' at level 2 and there is no level 
“before” (higher than) level 2 at which P dominates I. The presence of dom(idl, id2) 
in the answer set indicates that / dominates V. 

Similarly, the unique answer set of meta wea k(weak(P), in-as, as, X) U 
reify(I, in-as, id 1) U reify(P, in-as, id2) U (as(idl), as(id2), lv(l), lv(2)} U 
dominates(id2, idl) contains dom_lv(id2, idl, 1), because P dominates I at level 1; 
contains non_dom_lv(id2, idl, 2), because I dominates P at level 2; contains 
non_bef (id2, idl, 1) as / dominates P at level 2, which is evaluated “before” level 1; 
and finally, does not contain dom(id2, idl) because there is no level l in the program 
such that P dominates I at l and / does not dominate P at any higher level. 

A.l Encoding the search for positive hypotheses: T meta 

We now use the components described in the previous section to define a program 
Tmeta whose answer sets correspond to the positive solutions of an ILPloas task 
T. 

Definition 13 

Let T be the ILPloas task (B, Sm, E + , E~, O b , O c ). Then T meta = meta(B) U 
meta(SM ) U meta(E + ) U meta(E~) U meta(O b ) U meta(O c ) where each meta com¬ 
ponent is as follows: 

• meta(B) = append(reify(non-weak(B), in-as, X), as(X)) 

U meta wea k(weak(B ), in_as, as, X). 

• meta(SM) = 

{append(append(reify(R, in-as, X), as(X)), in-h(Rid)) \ R £ non-weak(S m )} 
U {append ( W, inJi( Wu)) \ W £ meta wea k(weak(SM), in-as, as, X)} 

U in_h(Ri d ). [2 * |R|@0, R id ] | R £ S M } 

U { (in_h(Rid) : R £ S„}. } 
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• meta(E + ) 


• meta(E ) 


• meta(O b ) 


• meta(O c ) 


(e inc , e exc ) G E+ 


U 


cover(e, e id ) 
as(e id ). 

v_i : -in_as(ei nc , n), . . . , in_as(e xnc , n), 
not in_as (ei IC , n),..., 
not in_as (e® xc , n). 
as(n). 

violating:-v_i. 

not violating. [l@ 0 ] 
as(oidi). as(o id 2). 
cover(e 1 , o idl ) 
cover(e 2 , o id2 ) 
dominates{oidi , o zd2 ) 

: -not dom(oidi, 0^2) • 
dominates (ei, 62 ) 

V -P( e idi e id) : ” 

not dom(ei,e2). 


Je mc , e exc ) G E~ 


O = (ei, e 2 ) G O b 


(ei, e 2 ) G O c >U 


>U{lv(l). | l G L} 


v_p: -v_p(Tl, T2). 
violating: -v_p. 


The intuition is that the in_h atoms correspond to the rules in the hypothesis. 
Each rule R G Sm has a unique identifier Ri d and if inJi(R id ) is true then R is 
considered to be part of the hypothesis H. These in_h atoms have been added to 
the bodies of the rules in the meta encoding so that a rule R G Sm only has an 
effect if it is part of H. 

Each of the terms t for which there is a fact as(t) represents an answer set of 
B U H. As in the previous section the cover program cover(I , t) is used to enforce 
that some of these answer sets extend particular partial interpretations. 

There is one as(t) atom for each positive example e. The cover program is used to 
ensure that the corresponding answer set does extend e. There are two as(t) atoms 
for each brave ordering (ei, e 2 ). Two instances of the cover program are used to 
ensure that the first answer set extends e± and the second answer set extends e 2 . 
We also use the dominates program from the previous section and a constraint to 
ensure that the first answer set dominates the second (hence the ordering is bravely 
respected). 

For the negative examples and the cautious orderings, the aim is to generate 
violating in at least one answer set of the meta encoding corresponding to H , if H 
is indeed a violating solution (generating v_i if H does not cover a negative example 
and some instance of v_p if it does not respect a cautious ordering). 

Firstly, for the negative examples, we have an extra fact as(n). As we have no 
constraints on the answer set of B U H which this can correspond to, the intuition 
is that there is one answer set of the meta encoding for each answer set of B U H . 
For each negative example e~ there is a rule for v_i which will generate v_i if the 
answer set corresponding to as(n) extends e~. 

For the cautious orderings we use a similar approach. For any cautious ordering 
(ei, e 2 ), as e\ and e 2 are positive examples, there are already two as(t) atoms which 
represent answer sets extending each of these interpretations; in fact, there will be 
one answer set of the meta encoding for each possible pair of answer sets of B U H 
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which extend these interpretations. Therefore, by using the dominates program, 
and generating a v_p atom if the answer set of B U H corresponding to the first 
as(t) atom does not dominate the answer set of B U H corresponding to the second 
as(t) atom in any answer set of the meta encoding, we ensure that violating will 
be true in at least one answer set of the meta encoding which corresponds to H. 

Example 8 


Consider the learning task: 


r <{ P ( 2 )}, 0 ), 


p(V):-r(V),n°t q(V). 
q(V) : _r (V), not p(V). 

H 

<0, {p(2)}>, 

<{a},{b}}, 

B = < 

r(l). r(2). 


l <0,{a}) 


a:-not b. 

E~ = { ({ P (l)},0> } 


, b:-not a. 

O b = { 

(4,4) } 

S M = 

| q(l). | 

< :~q(V).[i@l,V,r2] 

[ :~b.[l®l,b,r3] J 

O c = { (e+, 4) } 


Figure IXT1 shows T meta ■ There are two optimal positive hypotheses (one contain¬ 
ing each of the two weak constraints). 

The positive hypothesis b ■ [1@1, b, r3] has a violating interpretation {p(l), q(2), 
r(l), r(2), a}. This corresponds to the following answer set of T met a' 

{ as(l), as(2), as(3), as(4), as(n), as(5), as(6), lv(l) , in_as(r(l),1), 
in_as(r(l),2), in_as(r(l),3), in_as(r(l),4), in_as(r(l),n), in_as(r(l),5), 
in_as(r(l),6), in_as(r(2),1), in_as(r(2),2), in_as(r(2),3), in_as(r(2),4), 
in_as(r(2),n), in_as(r(2),5), in_as(r(2),6), in_as(q(l),3), in_as(q(l),4), 
in_as(q(l),6), in_as(p(l),1), in_as(p(l),2), in_as(p(l),n), in_as(p(l),5), 
in_as(p(2),1), in_as(q(2),2), in_as(q(2),3), in_as(q(2),4), in_as(q(2),n), 
in_as(q(2),5), in_as(q(2),6), in_as(a,l), in_as(a,2), in_as(a,3), 
in_as(b,4), in_as(a,n), in_as(a,5), in_as(b,6), in_h(r2), 
w(l,1,args(2,r2),2), w(l,1,args(1,r2),3), w(l,1,args(2,r2),3), 
w(l,1,args(1,r2),4), w(l,1,args(2,r2),4), w(l,1,args(2,r2),n), 
w(l,1,args(2,r2),5), w(l,1,args(1,r2),6), w(l,1,args(2,r2),6), cov(l), 
cov(2), cov(3), cov(4), v_i, violating, dom_lv(l,2,1), dom(l,2), cov(5), 
cov(6), dom_lv(5,6,1), dom(5,6) } 

Note that the violating interpretation can be extracted from the in_as(_, n) atoms 
and the hypothesis can be extracted from the in_h atoms. 

Similarly, the violating pair ({p(2), q(l), r(l), r(2), a}, {p(l), q(2), r(l), r(2), a}) can 
be extracted from the answer set: 

{ as(l), as(2), as(3), as(4), as(n), as(5), as(6), lv(l), in_as(r(l),1), 
in_as(r(l),2), in_as(r(l),3), in_as(r(l),4), in_as(r(l),n), in_as(r(l),5), 
in_as(r(l),6), in_as(r(2),1) , in_as(r(2),2), in_as(r(2),3), in_as(r(2),4), 
in_as(r(2),n), in_as(r(2),5), in_as(r(2),6), in_as(q(1),1), in_as(q(l),4), 
in_as(q(l),6), in_as(p(l),2), in_as(p(l),3), in_as(p(l),n), in_as(p(l),5), 
in_as(p(2),1), in_as(q(2),2), in_as(q(2),3), in_as(q(2),4), in_as(q(2),n), 
in_as(q(2),5), in_as(q(2),6), in_as(a,l), in_as(a,2), in_as(a,3), 
in_as(b,4), in_as(a,n), in_as(a,5), in_as(b,6), w(l,1,args(1,r2),1), 
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in_h(r2), w(l,1,args(2,r2),2), w(l,1,args(2,r2),3), w(l,1,args(l,r2),4), 
w(l,1,args(2,r2),4), w(l,1,args(2,r2),n), w(l,1,args(2,r2),5), 
w(l,1,args(l,r2),6), w(l,1,args(2,r2),6), cov(l), cov(2), cov(3), cov(4), 
v_i, violating, v_p(l,2), v_p, cov(5), cov(6), dom_lv(5,6,1), dom(5,6) } 


7. meta(B) 

in_as(p(V),X) in_as(r(V),X), 

not in_as(q(V),X), as(X). 
in_as(q(V),X) in_as(r(V),X), 

not in_as(p(V),X), as(X). 
in_as(r(l),X) as(X). 

in_as(r(2),X) as(X). 

in_as(a,X) not in_as(b,X), as(X). 

in_as(b,X) not in_as(a,X), as(X). 

7 0 meta(S_M) 

in_as(q(l),X) as(X), in_h(rl). 

w(l,1,args(V,r2),X) in_as(q(V),X), 

as(X), in_h(r2). 

w(l,1,args(b,r3),X) in_as(b,X), 

as(X), in_h(r3). 

0 {in_h(rl), in_h(r2), in_h(r3)} 2. 

: ~ in_h(rl) . [200,rl] 
in_h(r2).[200, r2] 

:~ in_h(r3).[200,r3] 


7. meta(E~+) 



as(l) . 




as(2) . 




as(3) . 




as(4) . 




cov(l) 

: - in_ 

as (p(2) 

,1). 

cov(2) 

: - not 

in_as(p(2) , 

n 

o 

<1 

CO 
'—✓ 

: - in_ 

as(a,3) 

, not 

o 

o 

< 

'—✓ 

: - not 

in_as( 

a,4) . 

: - not 

cov(l) 



: - not 

cov(2) 



: - not 

cov(3) 



: - not 

cov(4) 




7, meta(E~-) 

v_i in_as(p(l),n). 

as(n). 

violating v_i. 

not violating. [100, violating] 

7. meta(0~b) 
as(5) . 
as(6) . 

cov(5) in_as(a,5), not in_as(b,5). 

cov(6) not in_as(a,6). 

:- not cov(5). 

:- not cov(6). 
dom_lv(5,6,L) lv(L), 

#sum{w(W,L,A,5)=W, w(W,L,A,6)=-W> < 0 
wrong_dom_lv(5,6,L) lv(L), 

#sum{w(W,L,A,6)=W, w(W,L,A,5)=-W> < 0 
wrong_bef(5,6,L) lv(L), L < L2, 
wrong_dom_lv(5,6,L2). 
dom(5,6) dom_lv(5,6,L), 

not wrong_bef(5,6,L). 

:- not dom(5,6). 

1 v (1) . 

7. meta(0~c) 

dom_lv(l,2,L) lv(L), 

#sum{w(W,L,A,1)=W, w(W,L,A,2)=-W> < 0 
wrong_dom_lv(l,2,L) lv(L), 

#sum{w(W,L,A,2)=W, w(W,L,A,l)=-W> < 0 
wrong_bef(1,2,L) lv(L), L < L2, 
wrong_dom_lv(1,2,L2). 
dom(l,2) dom_lv(l,2,L), 

not wrong_bef(1,2,L). 
v_p(l,2) not dom(l,2). 

v_p :- v_p(X,Y). 
violating v_p. 


Fig. Al: An example of T meta ■ 
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A.2 Encoding classes of violating hypotheses: VR me t a 

The previous section set out how to construct a meta level ASP program T meta 
whose answer sets correspond to the positive hypotheses. It is also able to identify 
violating hypotheses, but has no way of eliminating them (as only a subset of 
the meta level answer sets corresponding to a violating hypothesis will identify it 
as violating, so eliminating these answer sets does not necessarily eliminate the 
hypothesis). 

We therefore present a new meta level program which, combined with T meta , 
eliminates any hypothesis which is violating for a given set of violating reasons. As 
violating reasons are full answer sets (or pair of full answer sets), we do not have 
to generate answer sets. It is enough to check whether the answer sets we have as 
part of the violating reasons are still answer sets of B U H. The standard way to 
do this is check whether these answer sets are the minimal model of the reduct of 
B U H with respect to this answer set. The next two definitions define a meta level 
program which, given an interpretation, computes the minimal model of the reduct 
of a program with respect to that interpretation. This reduct construction is close 
to the simplified reduct for choice rules from (ILaw et al. 2015cl) . 

Definition 14 

Given any choice rule R = l{ln,... ,h n }u:-body, reductify(R) is the program: 
mmr(hi, X): -reify(body + , mmr, X), reify(body~ , not in_vs,X), ' 
l{in_vs(hi, X),. . . , in_vs(h n , X)}u, in_vs(hi, X)- 

mmr(h n , X): -reify(body + , mmr, X), reif y(body~, not in_vs, X), 

< l{in_vs(hi, X),. . . , in_vs(h n , X)}u, in_vs(h n , X)- > 

mmr(_L, X): -reify(body + , mmr, X), reify(body~ , not in_vs, X), 
u + l{in_vs(hi, X),. .. , in_vs (h n , X)} • 
mmr(_L, X): -reify(body + , mmr, X), reify(body _ , not in_vs, X), 

„ {in_vs(hi, X),. .. , in_vs(h n , X)}1 — 1- 


Definition 15 

Let P be an ASP program such that Pi is the set of normal rules in P, P 2 is the 
set of constraints in P and P 3 is the set of choice rules in P. 

mmr(head(R), X): -reify(body + (R), mmr, X), 
reify(body” (R), not in_vs, X), vs(X). 
mmr(_L, X): -reify(body + (R), mmr, X), 

reify(body~ (R), not in_vs, X), vs(X). 

U {reductify(R) \ R € P 3 }- 


reductify{P) = 


R € Pi 


U 


R e i\ 


Example 9 

Consider the program P = 


p:-not q. 
q:-not p. 

mmr(p, X): -not in_vs(q, X), vs(X). 
mmr(q, X): -not in_vs(p, X), vs(X). 

We can check whether {p} is an answer set by combining reductify(P) with 


reductify(P) = 










Learning Weak Constraints in Answer Set Programming 


23 


{vs(vsl), in_vs(p, vsl)}. The answer set of this program is {vs(vsl), in_vs(p, vsl), 
mmr(p, vsl)}, from which the minimal model, {p}, can be extracted. This shows that 
{p} is indeed an answer set of P. 


Definition 16 

Let T be the ILPloas task ( B, Sm, E + , E~, O b , O c ) and VR be the set of violating 
reasons VI U VP, where VI are violating interpretations and VP are violating pairs. 

VRmeta(T) is the program meta(VI) U meta(VP) U meta(Aux) where the meta 
components are defined as follows: 


meta( VI) = 


meta(VP) = < 


reify (I, in.vs,Ii d ) 

:-not nas(lid). I £ VI 
vs(lid). 
dominates(vpidi, vpid 2 ) 
reify{h, in_vs, vp idl ) 
reify(h, in.vs, vp id 2 ) 

VS (vpidl) . 
vs(vp id 2 ) ■ 

:-not nas(vpidi), not nas(vpid 2 ), 
not domfvpidi, vp id 2 ) ■ 


• meta(Aux) = 
reductify(B) 
nas(X) 


U 


vp = {h,I 2 ) £ VP 


in_vs(AT0M, X), not mmr(AT0M, X). 
nas(X): -not in_vs(AT0M, X), mmr(AT0M, X). 


U { append (reductify(R), in-hyp(Ri d )) \ R £ non-weak(SM )} 

U {append (meta wea k{W , injvs, vs, X), inJiyp(Wi d )) \ W £ weak(SM)} 
U {meta wea k(W, in-vs, vs, X) \ W £ weak(B)} 

U (lv(l). | l £ L} 


This meta encoding uses the reductify program to check whether the various 
interpretations in each violating reason is still an answer set of B U H. There is a 
constraint for each of the violating interpretation, ensuring that it is no longer an 
answer set of BUH. Similarly, there is a constraint for each violating pair that says, 
if both interpretations are still answer sets of BUH, then the first must dominate 
the second. This is checked by using the dominates program as before (the weak 
constraints are also translated as before). 


Example 10 

Recall B, Sm, E + , E~, O b and O c from example [H] and let VI be the set contain¬ 
ing the violating interpretation (p(l),p(2), r(l), r(2), a}, VP the set containing the 
violating pair (|p(2), q(l), r(l), r(2), a}, {q(l), q(2), r(l), r(2), a}) and let VR be the 
set of violating reasons VI U VP. Then figure IX2l shows VR meta {T). 

Now that we have ruled out any hypothesis with these violating reasons, one 
optimal answer set of this program is: 

{ as(l), as(2), as(3), as(4), as(n), as(5), as(6), lv(l), in_vs(p(l),vl), 
in_vs(p(2),vl), in_vs(r(l),vl), in_vs(r(2),vl), in_vs(a,vl), vs(vl), 
in_vs(p(2),v2), in_vs(q(l),v2), in_vs(r(l),v2), in_vs(r(2),v2), in_vs(a,v2), 
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vs(v2), in_vs(q(l),v3), in_vs(q(2),v3), in_vs(r(l),v3), in_vs(r(2),v3), 
in_vs(a,v3), vs(v3), in_as(r(1),1), in_as(r(l),2), in_as(r(l),3), 
in_as(r(l),4), in_as(r(l),n), in_as(r(l),5), in_as(r(l),6), in_as(r(2),1), 
in_as(r(2),2), in_as(r(2),3), in_as(r(2),4), in_as(r(2),n), in_as(r(2),5), 
in_as(r(2),6), in_as(q(l),1), in_h(rl), in_as(q(l),2), in_as(q(l),3), 
in_as(q(l),4), in_as(q(l),n), in_as(q(l),5), in_as(q(l),6), in_as(p(2),1), 
in_as(q(2),2), in_as(p(2),3), in_as(q(2),4), in_as(p(2),n), in_as(p(2),5), 
in_as(q(2),6), in_as(a,1), in_as(a,2), in_as(a,3), in_as(b,4), in_as(a,n), 
in_as(a,5) in_as(b,6), w(l,1,args(1,r2),1), in_h(r2), w(l,1,args(l,r2),2), 
w(l,l,args(2,r2),2), w(l,1,args(l,r2),3), w(l,1,args(l,r2),4), 
w(l,l,args(2,r2),4), w(l,1,args(l,r2),n), w(l,1,args(l,r2),5), 
w(l,1,args(l,r2),6), w(l,1,args(2,r2),6), cov(l), cov(2), cov(3), cov(4), 
w(l,1,ts(l),v2), w(l,1,tsCl)>v3), w(l,1,ts(2),v3), dom_lv(l,2,1). dom(l,2), 
cov(5), cov(6), dom_lv(5,6,1), dom(5,6), mmr(r(1),vl), mmr(r(1),v2), 
mmr(r(1),v3), mmr(r(2),vl), mmr(r(2),v2), mmr(r(2),v3), mmr(p(l),vl), 
mmr(p(2),vl), mmr(p(2),v2), mmr(q(l),v2), mmr(q(l),v3), mmr(q(2),v3), 
mmr(a,vl), mmr(a,v2), mmr(a,v3), mmr(q(1),vl), nas(vl), dom_lv(v2,v3,1). 
dom(v2,v3) } 

This answer set corresponds to the hypothesis: 
q(l). 

q(V).[101,V,r2] 

As the optimality of this answer set is 5, we know that this hypothesis cannot have 
any violating reasons, as otherwise, there would be an answer set with optimality 
4 corresponding to the hypothesis (which would have been returned as optimal). 
Hence, the hypothesis must be an optimal inductive solution of the task. 
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7. meta(VI) 
in_vs(p(l),vl). 
in_vs(p(2),vl). 
in_vs(r(l),vl). 
in_vs(r(2),vl). 
in_vs(a,vl). 
vs(vl). 

:- not nas(vl). 

7, meta(VP) 
in_vs(p(2),v2). 
in_vs(q(l),v2). 
in_vs(r(l),v2). 
in_vs(r(2),v2). 
in_vs(a,v2). 
vs(v2). 

in_vs(q(l),v3). 
in_vs(q(2),v3). 
in_vs(r(l),v3). 
in_vs(r(2),v3). 
in_vs(a,v3). 
vs(v3). 

dom_lv(v2,v3,L) lv(L), 
Ssum-CwCW.L, A, v2)=W, 

w(W,L,A,v3)=-W> < 
wrong_dom_lv(v2,v3,L) 


#sum-[w (W, L, A, v3) =W, 

w(W,L,A,v2)=-W} < 0. 
wrong_bef(v2,v3,L) lv(L), L < L2, 
wrong_dom_lv(1,2,L2). 
dom(v2,v3) dom_lv(v2,v3,L), 
not wrong_bef(v2,v3,L). 

not nas(v2), not nas(v3), 
not dom(v2,v3). 

7. meta(Aux) 

mmr(p(V),X) mmr(r(V),X), 
not in_vs(q(V),X), vs(X). 
mmr(q(V),X) mmr(r(V),X), 
not in_vs(p(V),X), vs(X). 
mmr(r(l),X) vs(X). 

mmr(r(2),X) vs(X). 

mmr(a,X) not in_vs(b,X), vs(X). 

mmr(b,X) not in_vs(a,X), vs(X). 

mmr(q(l),X) vs(X), in_h(rl). 

w(l, 1,ts(V) ,X) vs(X), 
in_vs(q(V),X), in_h(r2). 
w(l,1,args(b,r3),X) vs(X), 
in_vs(b,X), in_h(r3). 

0. nas(X) in_vs(A,X), not mmr(A,X). 

lv(L), nas(X) not in_vs(A,X), mmr(A,X). 


Fig. A2: An example of VR me t a (T). 



